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the combined similarity inforaiation, an objective function is iteratively maximized in order 
to yield a generalized similarity value that expresses the similarity of particular pairs of 
documents. In an embodiment, the generalized similarity value is used to determine the 
proper category, among a taxonomy of categories in an index, cache or search system, into 
v^hich certain documents belong. 



In the Claims: 



In the claims, please amend claims 1, 19, 20, 21, 28, 29 and 34. Please delete claims 



35-37. 
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A method of categorizing a plurality of nev^ electronic documents into a 
Set of dategories, comprising the steps of: 

esfablisning a plurality of training sets, wherein each training set is associated v^ith a 
category and includes training documents that have been classified as 
belonging to said associated category; 

determiningViow strongly each document of said plurality of documents corresponds 
to each of said plurality of categories by determining similarity between said 
each document and the training documents that belong to the training set of 
said^categpry; and 

wherein the step oXdetermining similarity is performed using a matrix representing 
document similarity that is derived by combining two or more measures of 
dncuTTTgr)^ simiVirity 



1 19. (Amended) A method as recited in Claim 1, further comprising the step of extracting 

2 similarity information from the similarity matrix to obtain new documents supported 

3 by the set of training documents for each category. 
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1 20. (Amended) A method as recited in Claim 19, wherein the similarity information is 

2 obtained by optimizing an objective function. 

1 21. (Amended) A method as recited in Claim 19, wherein the similarity information is 

2 obtained by only approximately optimizing an objective function. 

1 28. (Amended) A method as recited in Claim 1, further comprising the steps of creating 

2 and storing a second matrix using columns representing documents and rows 

3 representing user sessions, and wherein values of elements of the second matrix 

4 represent interest in a document shown by a particular user in a particular session. 

1 ^-d^ (Amended) A method as recited in Claim 1, further comprising the steps of creating 

2 and storing a matrix using columns representing user sessions and rows representing 

3 documents, and wherein values of elements of the second matrix represent interest in 

4 a document shown by a particular user in a particular session. 
^^^^ 

(An\ended) A computer-readable medium carrying one or more sequences of 

2 strufctions, wherein execution of the one or more sequences of instructions by one or 

3 ^ m\^ processors causes the one or more processors to perform the steps of: 

4 establishing a plurality of training sets, wherein each training set is associated with a 

5 category and includes training documents that have been classified as 

6 beloWing to said associated category; 

7 determining how strongly each document of said plurality of documents corresponds 

8 to each of said plurality of categories by determining similarity between said 

9 each document and the documents that belong'to the training set of said 
10 category; and 
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1 1 wherein the step of determining similarity is performed using a matrix representing 

12 document similarity mat is derived by combining two or more measures of 

13 document similarity. \ 

1 35. (Cancel) A i«ethod of categorizing a plurality of new electronic documents for use in 

2 a hypertext search system, the method comprising the steps of: 

3 creating and stoirng a set of categories for the documents; 

4 creating and storinaa matrix, in which rows and columns identify documents, and in 

5 whicliyach elWent of the matrix stores a value that represents a similarity 

6 among a pair ofuocuments associated with a row and column that intersect at 

7 the element; \ 

8 deriving each matrix value D\y combining two or more measures of similarity that are 

9 obtained by analysis oY the documents. 

1 36. (Cancel) A method as recited in Gllaim 35, further comprising the steps of: 

2 for each measure of document simNarity, creating and storing a graph of links; 

3 creating and storing a combined graph that combines the graphs and that represents a 

4 generalized similarity of the documents; 

5 computing a generalized similarity valute for a pair of documents based on the 

6 combined graph. \ 

1 37. (Cancel) A method as recited in Claim 36, farther comprising the steps of 

2 classifying unclassified documents into category nodes of a taxonomy structure 

3 associated with the hypertext search system based on the generalized similarity 

4 value in combination with a comparison of a set bf pre-classified training set of 

5 documents with a set of unclassified documents, to carry out classification. 
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